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(54) Title: METHOD FOR CHARACTERIZING NUCLEIC ACID MOLECULES 
(57) Abstract 

A method of characterizing a nucleic acid molecule is disclosed. The method comprises synthesizing DNA in the presence of a 
reaction mixture comprising a nucleic acid template, a primer molecule, an enzyme mat extends the primer so that a DNA molecule may be 
synthesized, four canonical deoxynucleoside triphosphates and at least one non-canonical deoxynucleoside triphosphate. The non-canonical 
deoxynucleoside triphosphate is incorporated into the synthesized DNA in place of a portion of only one canonical deoxynucleoside 
triphosphate. The synthesized DNA is treated with an N-glycosylase that excises a base portion of the non-canonical deoxynucleoside 
triphosphate from the synthesized DNA. The DNA is then treated in such a manner that the phosphodiester backbone of the DNA is broken 
at the abasic site, thus creating at least two DNA fragments. The fragments are separated according to size. 
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METHOD FOR CHARACTERIZING NUCLEIC ACID MOLECULES 

FIELD OF THE INVENTION 

The field of the present invention is methods of 
characterizing nucleic acid molecules. Specifically, the 
present invention concerns characterizing nucleic acid 
5 molecules by synthesizing DNA in the presence of a 

non-canonical deoxynucleoside triphosphate, excising the base 
portion of the non-canonical deoxynucleoside triphosphate, 
breaking the phosphodiester backbone of the DNA at the abasic 
sites, and analyzing the resulting DNA fragments. 



10 BACKGROUND OF THE INVENTION 

Methods of Characterizing Nucleic Acid Molecules 

There are many reasons for characterizing nucleic acid 
molecules. For example, genes are rapidly being identified 
and characterized which are causative or related to many 

15 human, animal and plant diseases. Even within any particular 
gene, numerous mutations are being identified that are 
responsible for particular pathological conditions . Thus, 
although many methods for detection of both known and unknown 
mutations have been developed (e.g., see Cotton, 1993), our 

20 growing knowledge of human and other genomes makes it 

increasingly important to develop new, better, and faster 
methods for characterizing nucleic acids. Besides diagnostic 
uses, improved methods for rapidly characterizing nucleic 
acids will also be useful in many other areas, including 

25 human forensics, paternity testing, animal and plant 
breeding, tissue typing, screening for smuggling of 
endangered species, and biological research. 

A variety of methods for characterizing DNA molecules 
are known in the art. For example, one can characterize DNA 

30 molecules by size based on their electrophoretic migration 
through an agarose or polyacrylamide gel. In these methods, 
the negatively charged DNA molecules move through a gel in 
the direction of the positively charged electrode. Provided 
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that the percentage of agarose or polyacryl amide in the gel 
is appropriate for the size range of the DNA molecules being 
electrophoreses smaller DNA molecules move through the pores 
of the gel more readily than larger DNA molecules. Because 
DNA molecules move in the gel at size -dependent rates, 
molecule sizes can be determined by staining and visualizing 
the DNA and then comparing the migration of the sample DNA 
molecules in the gel with the migration of marker DNA 
molecules of known size. Under the appropriate conditions 
single-stranded DNA molecules differing in length by even a 
single nucleotide can be distinguished by denaturing 
polyacrylamide gel electrophoresis. 

Another way to characterize DNA molecules is to treat 
each DNA molecule with one or more restriction endonucleases 
and then to determine the sizes of the various DNA fragments 
resulting from this treatment by agarose gel electrophoresis. 
Restriction endonucleases are enzymes that recognize specific 
sequences of bases in DNA (often 4, 5, 6 or sometimes 8 bases 
on each DNA strand) and then cut the phosphodiester bonds of 
the polynucleotide chains of DNA within to the recognition 
sequence. Because many restriction endonucleases with 
different recognition sequences are available, one can obtain 
a restriction map of an entire DNA molecule showing locations 
of restriction enzyme recognition sites and distances between 
them by determining which other restriction enzymes will cut 
each DNA fragment generated by any given restriction enzyme 
and what are the sizes of all of the resulting fragments. 
Such a restriction map is characteristic for a particular DNA 
molecule and can be used to obtain a rough identification of 
a particular sequence. Additionally, changes such as those 
caused by mutations in DNA may result in a loss or gain of a 
restriction site - a so-called "restriction fragment length 
polymorphism" (RFLP) (Kazazian, et al., 1989). An example of 
a diagnostically significant RFLP is a single base mutation 
in the beta-hemoglobin gene, the change from A to T which 
eliminates a Dde I restriction site, which results in sickle 
cell anemia (Kazazian, et al . , 1989). 1 

One of the most informative ways to characterize I DNA 
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molecule is to determine its nucleotide sequence. One method 
for sequencing DNA (Maxam and Gilbert, 1977) is accomplished 
by treating each of four aliquots of one strand of a 5'- or 
3 ' -end-labelled DNA molecule to be sequenced with one of four 
5 different chemical reagents. One chemical specifically 

modifies only the guanine base in the DNA, another modifies 
only cytosine, another modifies either guanine or adenine 
bases, and the last chemical modifies either thymine or 
cytosine bases. The chemical treatments are carried out 

10 under conditions so that only a small proportion of the total 
susceptible bases will actually be modified. 

It is important that the chemical reactions are limited 
in order to generate a nested set of fragments differing by 
one base of a specific type. If all G residues, for example, 

15 were modified, the residues would all be susceptible to 
phosphodiester bond cleavage. Therefore, a collection of 
partially modified nucleic acids is required for sequencing. 
Subsequent treatment with piperidine results in cleavage of 
the phosphodiester bonds of the DNA molecule at the abasic 

20 sites, generating a mixture of all sizes of DNA molecules 

that are possible following chemical modification and loss of 
each one of the corresponding susceptible bases. The DNA 
molecules in each of the four reactions are then resolved by 
electrophoresis in adjacent lanes of a polyacrylamide gel and 

25 the pattern of bands is revealed by exposing the gel to X-ray 
film if the DNA molecules are labelled with a radioisotope. 
The sequence of the DNA is revealed by analyzing the exposed 
X-ray film. Alternatively, if DNA molecules are labelled with 
a fluorescent, chemiluminescent or some other non- radioactive 

30 moiety, the sequence is revealed by an appropriate method 
known in the art . 

The most commonly used method for sequencing DNA at 
this time (Sanger, et al., 1977) uses a DNA polymerase to 
produce differently sized fragments depending on the 

35 positions (sequence) of the four canonical bases (A = 

Adenine; C = Cytidine; G = Guanine; and T = Thymine) within 
the DNA co be sequenced. In this method, the DNA to be 
sequenced is used as a template for iif vitro DNA synthesis. 
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In addition to all four of the canonical deoxynucleotides 
(dATP, dCTP, dGTP and dTTP) , a 2 ' , 3 • -dideoxynucleotide is 
also included in each in vitro DNA synthesis reaction at a 
concentration that will result in random substitution of a 
5 small percentage of a canonical nucleotide by the 

corresponding dideoxynucleotide. Thus, each DNA synthesis 
reaction yields a mixture of DNA fragments of different 
lengths corresponding to chain termination wherever the 
dideoxynucleotide was incorporated in place of the normal 

10 deoxynucleotide. The DNA fragments are labelled, either 

radioactively or non-radioactively, by one of several methods 
and the label (s) may be incorporated into the DNA by 
extension of a labelled primer, or by incorporation of a 
labelled deoxy or dideoxy nucleotide. By carrying out DNA 

15 synthesis reactions for each of the four dideoxynucleotides 
(ddATP, ddCTP, ddGTP or ddTTP) , then separating the products 
of each reaction in adjacent lanes of a denaturing 
polyacrylamide gel, and detecting those products by one of 
several methods, the sequence of the DNA template can be read 

20 directly. 

Cycle Sequencing is a variation of Sanger sequencing 
that achieves a linear amplification of the sequencing signal 
by using a thermostable DNA polymerase and repeating chain 
terminating DNA synthesis during each of multiple rounds of 
25 denaturation of a template DNA (e.g., at 95°C) , annealing of 
a single primer oligonucleotide (e.g., at 55°C) , and 
extension of the primer (e.g., at 70°C) . 

Nucleic acid sequencing provides the highest degree 
of certainty as to the identity of a particular nucleic acid. 
Also, nucleic acid sequencing permits one to detect mutations 
in a gene even if the site of the mutation is unknown. 
Sequencing data may even provide enough information to permit 
an estimation of the clinical significance of a particular 
mutation or of a variation in the sequence. 

In order to characterize a nucleic acid by sequencing, 
the nucleic acid must, be isolated in sufficient quantity to 
be used for the particular method. Although it may be 
possible to obtain sufficient quantities of a nucleic acid 
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for sequencing by first cloning it into a plasmid or other 
vector, this procedure is time-consuming and is often not 
practical for routine analysis of samples for clinical 
diagnostics or other purposes. When the amount of nucleic 
5 acid in a sample is less than optimal for a given method, it 
may be advantageous to use one of several methods which have 
been developed for amplifying parts of nucleic acid 
molecules. The polymerase chain reaction (PCR) , nucleic acid 
sequence -based amplification (NASBA) , self -sustained sequence 

10 replication (3SR) , transcription-mediated amplification (TT4A) 
and strand displacement amplification (SDA) are examples of 
some of the methods which have been developed for amplifying 
nucleic acid molecules in vitro. 

By way of example, a specific portion of a DNA 

15 molecule may be amplified using PCR by temperature cycling of 
a sample DNA in a buffer containing two primers (one primer 
complementary to each of the DNA strands and which, together, 
flank the DNA sequence of interest) , a thermostable DNA 
polymerase, and all four canonical 2 ' -deoxynucleoside-5' - 

20 triphosphates (dATP, dCTP, dGTP and dTTP) . The specific 

nucleic acid sequence is geometrically amplified during each 
of about 30 cycles of denaturation (e.g., at 95°C) , annealing 
of the two primers (e.g., at 55°C) , and extension of the 
primers by the DNA polymerase (e.g., at 70°C) , so that up to 

25 about a billion copies of the nucleic acid sequence are 
obtained. RNA may be similarly amplified using one of 
several protocols for RT-PCR, such as, for example, by 
carrying out the reaction using a thermostable DNA polymerase 
which also has reverse transcriptase activity (Myers and 

30 Gelfand, 1991) . 

The polymerase chain reaction (PCR) , as discussed 
above, is the subject of numerous publications, including 
Mullis, KB, et al., U.S. Pat. No. 4,683,202 and U.S. Pat. No. 
4,683,195; Mullis, KB, EP 201,184; Ehrlich, H. , EP 50,424, EP 

35 84,796, EP 258,017, & EP 237,362; Ehrlich, H. , U.S. Pat. No. 
4,582,788; Saiki , R. , et al., U.S. Pat. No. 4,683,202; 
Mullis, KB, et al . (1986) in Cold Spring Harbor Symp. Quant . 
Biol. 51:263; Saiki, R-, e£ al. (1985) Science 230:1350; 
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Saiki, et al. (1985) Science 231:487; and Loh, EY, et al. 
(1988) Nature 335:141. 

By way of a second example, all or a specific portion of 
an RNA molecule may be amplified using NASBA (Fahy, et al , 
1991) by isothermal incubation of a sample RNA in a buffer' 
containing two primers (a first primer complementary to the 
RNA molecule and encoding a promoter sequence for an RNA 
polymerase and a second primer complementary to the 3 '-end of 
the first cDNA strand resulting from reverse transcription of 
the RNA molecule) , an RNA- and DNA-dependent DNA polymerase 
which also has RNase H activity (or a separate RNase H 
enzyme), all four canonical 2 • -deoxynucleoside-5 ' - 
triphosphates (dATP, dCTP, dGTP and dTTP) , an RNA polymerase 
that recognizes the promoter sequence of the first primer, 
and all four ribonucleoside-5 ' -triphosphates (rATP, rCTP, ' 
rGTP and rUTP) . 

A first cDNA strand is synthesized by extension of the 
first primer by reverse transcription. Then, the RNase H 
digests the RNA of the resulting DNA: RNA hybrid, and the 
second primer primes synthesis of the second cDNA strand. 
The RNA polymerase then transcribes the resultant double- 
stranded DNA (ds-DNA) molecule from the RNA polymerase 
promoter sequence, making many more copies of RNA, which in 
turn, are reverse- transcribed into cDNA and the process 
begins all over again. This series of reactions, from ds-DNA 
through RNA intermediates to more ds-DNA, continues in a 
self-sustained way until reaction components are exhausted or 
the enzymes are inactivated. DNA samples can also be 
amplified by other variations of NASBA or 3SR. 

Strand Displacement Amplification (SDA) is another 
isothermal nucleic acid amplification technique (Walker, 
1994) . SDA is a method of nucleic acid amplification in 
which extension of primers, displacement of single stranded 
extension products, annealing of primers to the extension 
products (or the original target sequence) and subsequent 
extension of the primers occurs concurrently in the reaction 
mix. This is in contrast to the PCR, in which the steps of 
the reaction occur in discrete phases or cycles as a result 
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of the temperature constraints of the reaction. SDA is based 
upon 1) the ability of a restriction endonuclease to nick the 
unmodified strand of a hemiphosphorothioate form of its 
double- stranded recognition site and 2) the ability of 
5 certain polymerases to initiate replication at the nick and 
displace the downstream non-template strand. After an 
initial incubation at increased temperature (about 95°C) to 
denature double -stranded target sequences for annealing of 
the primers, subsequent polymerization and displacement of 

10 newly synthesized strands takes place at a constant 

temperature (usually about 37°C) . Production of each new 
copy of the target sequence consists of five steps: 1) 
binding of amplification primers to an original target 
sequence or a displaced single-stranded extension product 

15 previously polymerized, 2) extension of the primers by 

exonuclease deficient (exo) klenow polymerase incorporating 
an a-thio deoxynucleoside triphosphate, 3) nicking of a 
hemiphosphorothioate double -stranded restriction site, 4) 
dissociation of the restriction enzyme from the nick site, 

20 and 5) extension from the 3 '-end of the nick by exo" klenow 
with displacement of the downstream non-template strand. 
Nicking, polymerization and displacement occur concurrently 
and continuously at a constant temperature because extension 
from the nick regenerates another nickable restriction site. 

25 When primers which hybridize to both strands of a double - 
stranded target sequence are used, amplification is 
exponential, as the sense and antisense strands serve as 
templates for the opposite primer in subsequent rounds of 
amplification. 

30 PCR, NASBA and the other methods of nucleic acid 

amplification can be very useful for obtaining greater 
quantities of a nucleic acid for additional characterization. 
However, in general, the amplified nucleic acid molecules 
must be purifi d away from primers, nucleotides, incomplete 

35 amplification products and other impurities prior to being 

used for sequencing. Otherwise, for example, the PCR primers 
may compete with labelled sequencing primers and th^J PCR 
nucleotides may compete with the sequencing nucleotide mixes 
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that are used for Sanger dideoxy sequencing. Also, Sanger 
sequencing clearly can not be done at the same time as PCR or 
another amplification method, at least not efficiently, 
because the dideoxynucleotides used for sequencing will 
result in termination of the DNA amplification reactions as 
well as the sequencing reactions. 

One group has developed a method that attempts to 
decrease the number of steps required for sequencing nucleic 
acids (Shaw and Porter, PCT WO 95/06752). According to this 
method, 5'-alpha-borano-deoxynucleoside triphosphates, which 
were found to be resistant to exonuclease III (exo Hi) 
digestion, were incorporated into DNA during in vitro DNA 
synthesis in lieu of one of the canonical nucleotides (dATP 
dCTP, dGTP, dTTP) in one of four primer extension reactions! 
Treatment with exo III will digest the synthesized DNA up to 
the point of alpha-borano deoxynucleoside incorporation. 
After digestion with exo III and resolution of the labeled 
fragments on a polyacryl amide gel, the sequence of the 
nucleic acid can determined. 

An advantage of the alpha-borano/exo III method is that 
it can be integrated into PCR amplification. However, there 
are also disadvantages. A key disadvantage is a lower degree 
of accuracy of the sequence data compared to Maxam-Gilbert or 
Sanger sequencing because the alpha-borano/exo III method 
gives both extra bands and missing bands on sequencing gels. 
Although the mechanism by which these sequencing artifacts 
are generated is still uncertain, the extra bands may be due 
to incomplete digestion of non-boronated regions of DNA by 
exo III, while missing bands may be due to preferential 
digestion by exo III through some sequences containing alpha- 
borano- nucleotides . Another serious disadvantage of the 
alpha-borano/exo III method is related to the substrate 
requirements of exo III. Because exo III digests only 
double-stranded DNA, beginning at the 3' -end of each strand, 
the sequence can only be determined for the 3 '-half of each' 
strand of a PCR product, so it is not possible to obtain the 
sequence near to the primer. Also, because exo III digestion 
yields only fragments that are between <lo% and 100% of the 
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length of the full-size PCR product, the size range of the 
DNA which can be sequenced by the method is somewhat limited. 
For example, if a PCR product of 1000 base pairs in length is 
sequenced according to the alpha- bo rano/exo III method, the 
5 fragments to be electrophoresed would be approximately 500- 
1000 nucleotides long. Fragments of such a length are more 
difficult to resolve in DNA sequencing gels. 

Uracil N-Glvcosvlase 

"Uracil N-glycosylase" or "uracil -DNA glycosylase" (UNG 

10 or UDG) is an enzyme that catalyzes the cleavage of the N- 
glycosidic bond between the base uracil and the sugar 
deoxyribose in DNA into which the non-canonical nucleotide 
2' -deoxyuridine- 5' -triphosphate (dUTP) has been incorporated 
in place of the canonical nucleotide dTTP (Lindahl, 1979) . 

15 UDG does not catalyze cleavage of uracil from free dUTP, free 
deoxyuridine or RNA (Duncan, 1981) . 

U.S. patent 5,035,996 describes a process in which UDG 
is used for controlling contamination of nucleic acid 
amplification reactions (Hartley, U.S. Pat. No. 5,035,996). 

20 The purpose of this invention was to pretreat reaction 

mixtures containing a new sample on which an amplification is 
to be carried out with UDG to assure that any uracil - 
containing DNA from a prior amplification of another sample 
had been destroyed and would not contaminate the second 

25 amplification reaction. Digesting the second amplification 
mixture with UDG prior to carrying out the second 
amplification reaction destroys the ability of any residual 
products of the first amplification from serving as a 
template for further amplifications. 

30 Somewhat similarly, U.S. patent 5,418,149 discloses the 

use of glycosylases to reduce non-specific amplification of 
nucleic acids. 

A method for introducing site-specific mutations into 
DNA has also been described that relies upon replacement of 

35 thymine with uracil in DNA and subs quent treatment with 
uracil -DNA glycosylase. :See U.S. patent 4,873,192 and 
Kunkel, 1985. Also, uracil -containing phage were suggested 
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as a part of a biological containment system that would 
transfer genetic information only to uracil -N glycosylase 
deficient cells and not to naturally occurring bacteria. See 
Warner, g£. al. , 1979. 

Another use for UDG in molecular biology has been 
described by Nisson, et al. , 1991. In Nisson( et al. udg is 
used to facilitate directional cloning of PCR products. 
Thus, primers to be used for PCR amplification are made to 
contain a specific 12 -base 5' sequence that contains dUMP in 
place of dTMP. Then, after the PCR amplification (without 
dUTP in the reaction mixture) , since the amplification 
products contain dUMP residues at each 5' terminus, treatment 
of these PCR products with UDG removes the uracil residues 
from the 5' termini. Subsequent treatment with heat results 
in cleavage of the phosphodiester bonds at the abasic sites 
where uracil has been lost and thereby, generates 12 -base 
cohesive termini which can be easily cloned into vectors with 
complementary termini . 

What is needed in the art is a method for characterizing 
nucleic acids that is as accurate and as specific as Maxam- 
Gilbert or Sanger sequencing in detecting and identifying 
nucleic acids and differences between nucleic acids, but that 
is easier, faster, more sensitive and/or requires less sample 
DNA. The new method should also be capable of being used for 
relatively impure nucleic acid samples, such as amplification 
products, without needing to purify the sample away from 
primers, nucleotides or other impurities. The method also 
should be capable of being_integrated into an amplification 
method such as PCR in a way similar to the alpha -borano/exo 
III method, but without having the disadvantages of the 
latter method. 



SUMMARY OP THE INVENTION 

The present invention provides a simple, reliable method 
for characterizing nucleic acid polymers. in one embodiment, 
the invention permits the detection and localization of 
mutations' in nucleic acid polymers. In another embodiment, 
the present invention allows one to determine whether nucleic 
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acid molecules are similar or different and the degree of 
similarity or difference. In another embodiment, the 
invention permits one to determine the sequence of a nucleic 
acid molecule. Because the sequence - delimiting non-canonical 
5 nucleotides used for the method are not chain terminators for 
DNA synthesis, the method may also be part of or coupled to 
an amplification method such as PCR, NASBA, or SDA. 

In the present invention, a DNA molecule is synthesized 
in the presence of a nucleic acid template, a primer, a 
10 polymerase and a non-canonical deoxynucleoside triphosphate 
as well as all four canonical deoxynucleoside triphosphates. 
The in vitro synthesized DNA is then treated with an enzyme 
that excises the non-canonical base from the DNA, whereby an 
abasic site is created. One then breaks the phosphodiester 
15 bonds of the synthesized DNA at the abasic sites and a series 
of molecules of different lengths is generated. 

Each size class of molecule corresponds to the site 
where a non-canonical nucleotide substituted for the 
canonical nucleotide within the original sequence. Size 
20 resolution of the nested sets of cleaved DNA molecules by gel 
electrophoresis, capillary electrophoresis, or another method 
shows the position of individual substitutions by non- 
canonical nucleotides . 

In one embodiment, the non-canonical deoxynucleoside 
25 triphosphate is dUTP. The enzyme that excises the non- 
canonical residue is preferably an N-glycosylase, such as 
uracil N-glycosylase . 

In one embodiment of the present invention, the 
nucleotide sequence of a DNA molecule is deduced by creating 
30 four different reactions in which four different non- 
canonical deoxynucleoside triphosphates substitute for a 
portion of each of the canonical deoxynucleoside 
triphosphates and comparing the products of the reactions. 

It is an object of the present invention to compare 
35 nucleic acid molecules at the nucleotide level. 

It is another object of the present invention to provide 
a method to detect mutations. 

It is another object of the present invention to provide 
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a method of incorporating a non-canonical deoxynucleoside 
triphosphate that may be compatible with amplification 
procedures . 

It is another object of the present invention to provide 
a method of determining the sequence of a nucleic acid 
molecule . 

It is another object of the present invention to provide 
a method to directly sequence amplification products. 

It is an advantage of the present invention that the 
method is accurate and reproducible. 

It is another advantage of the present invention that 
the method does not result in artificial results, such as 
missing or duplicated bands. 

Other advantages, features and objects of the present 
invention will become apparent after examination of the 
specification, claims and drawings. 

DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic diagram of the BESS method. 
Figure 2 is a schematic diagram showing the use of BESS 
to evaluate specific mutation events. 

Figure 3 is a schematic diagram showing coupled PCR and 

BESS. 

Figure 4 is a schematic diagram of DNA sequencing using 

BESS. 

Figure 5 is a schematic diagram showing the use of BESS 
to determine if and to what extent nucleic acid molecules are 
similar or different. 



DETAILED DESCRIPTION OF THE INVENTION 

Definite nna - 

The term "mutation", as used herein, refers to 
alterations in the nucleotide composition of a nucleic acid, 
and may consist of deletions, insertions, single and multiple 
residue changes in nucleic acid sequence. These changes 
normally consist of the substitution of a nucleotide residue 
by another species of commonly-occurring (canonical) 
nucleotide . I 
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"Nucleotide" refers to a base-sugarphosphate compound. 
Nucleotides are the monomeric subunits of both types of 
nucleic acid polymers, RNA and DNA. "Nucleotide" refers to 
ribonucleoside triphosphates, rATP, rGTP, rUTP and rCTP, and 
5 deoxyribonucleoside triphosphates, such as dATP, dGTP, dTTP, 
dCTP. "Nucleoside" refers to a base-sugar combination 
without a phosphate group. "Base" refers to the nitrogen- 
containing base, for example adenine (A) , cytidine (C) , 
guanine (G) and thymine (T) . 

10 "Incorporation" refers to becoming a part of a nucleic 

acid polymer. There is a known flexibility in the 
terminology about incorporation of nucleic acid precursors. 
For example, the nucleotide dGTP is a deoxyribonucleoside 
triphosphate. Upon incorporation into DNA, it becomes a 

15 dGMP, or deoxyguanosine monophosphate moiety. Although there 
is no dGTP molecule in DNA, one may say that one incorporates 
dGTP into DNA. 

The term "canonical" is used to refer to the four common 
nucleic acid bases adenine, cytosine, guanine and thymine 

20 that are commonly found in DNA or to the respective 
deoxyribonucleosides, deoxyribonucleotides or 2'- 
deoxyribonucleoside-5 ' -triphosphates that contain a canonical 
base. The term "non- canonical" is used to refer to nucleic 
acid bases in DNA other than the four canonical bases, or to 

25 the respective deoxyribonucleosides, deoxyribonucleotides, or 
2 ' -deoxyribonucleoside- 5' -triphosphates that contain a non- 
canonical base. Although uracil is a common nucleic acid 
base in RNA, uracil is a non-canonical base in DNA. "Non- 
canonical bases" are found in nucleic acids as a result of 

30 incorporation of non-canonical nucleotides or as a result of 
modification of existing bases (canonical or non- canonical) . 

The term "amplicon" has come to describe the product of 
amplification by the PCR or by other nucleic acid 
amplification techniques known to those skilled in the art. 

35 All temperatures described herein are expressed in 

degrees centrigrade, unless otherwise specified. 

i 

In General 
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We disclose a new method, referred to as "Base Excision 
Sequence Screening" or "BESS™", for characterizing nucleic 
acids. Figure 1 describes an embodiment of BESS. 

The BESS method is based on the incorporation of non- 
canonical nucleotides into DNA during in vitro DNA synthesis 
using the nucleic acid to be characterized as a template. 
Referring to Figure 1, a labelled primer is annealed to a 
nucleic acid template in the presence of a polymerase capable 
of extending the primer, all four canonical deoxynucleoside 
triphosphates and a non-canonical nucleotide. The specific 
non-canonical nucleotide demonstrated as an example in Figure 
1 is dUTP. Following DNA synthesis, each non-canonical base 
is excised from DNA by an enzyme specific for that base. In 
the Figure 1 example, this enzyme is uracil DNA glycosylase. 

Still referring to Figure 1, the DNA chain is then 
broken at the abasic sites corresponding to the positions of 
each non-canonical nucleotide. In Figure 1, the example 
demonstrated was the use of an AP endonuclease, Endonuclease 
IV, to break the DNA at abasic sites. Then, separation of 
the resulting DNA fragments yields patterns of fragments that 
end at the positions where non-canonical bases were removed. 
The positions of each non-canonical nucleotide within the in 
vitro- synthesized DNA correspond to the positions of a 
canonical nucleotide within the nucleic acid being 
characterized. 

BESS, even when applied to a single non-canonical 
nucleotide, may be used to obtain a sequence- related pattern 
of DNA fragments in order to characterize either the presence 
or absence of a particular nucleic acid sequence or its 
relatedness to another nucleic acid sequence (see Figs. 2 and 
5) . Because non-canonical nucleotides used for BESS do not 
terminate the DNA chain, BESS may be part of an amplification 
process such as PCR, RT-PCR, NASBA, or SDA, or any other 
method involving in vitro DNA synthesis (see Figure 3). BESS 
may also be used to determine the complete nucleotide 
sequence of a nucleic acid molecule (see Figure 4) . A 
preferred method for DNA synthesis during the BESS reaction 
is described in PCR Technology. Griffin and Griffin, eds., 
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CRC Press, Boca Raton, FL (1994) . The preferred minimal 
components for synthesis of DNA are: a template DNA (double- 
or single- stranded) , a primer oligodeoxynucleotide of a 
sequence which is the reverse complement of a portion of the 
5 template, the four canonical deoxynucleoside triphosphates at 
a concentration of 50 - 250 micromolar, a buffer of pH 6.5 - 
9.0, magnesium ions at 1 - 10 mM, and a DNA polymerase 
enzyme. The concentration of the non-canonical 
deoxynucleoside triphosphate and its proportion in the 

10 reaction mixture with respect to the corresponding canonical 
nucleotide are determined empirically for each non-canonical 
nucleotide and DNA polymerase combination. The present 
invention requires the incorporation of a quantity of non- 
canonical nucleotide such that, for any given site along the 

15 polynucleotide, only a fraction of all molecules will be 
cleaved at that site by an N-glycosylase enzyme. The non- 
canonical nucleotide may be a specific substitute for any of 
the four common nucleotide components for synthesizing DNA: 
deoxythymidine triphosphate, deoxy adenosine triphosphate, 

2 0 deoxycytosine triphosphate, or deoxyguanosine triphosphate. 
Some preferred methods are described in the Examples below. 

In the BESS method, the incorporation of the non- 
canonical nucleotide occurs during a standard DNA synthesis 
reaction. Either a DNA or RNA molecule may serve as the 

25 template. Those of skill in the art know of many 

modifications of this reaction. Preferably, the reaction is 
performed with a labelled primer containing a nucleotide 
sequence at least part of which is sufficiently complementary 
to a portion of the template molecule to hybridize with the 

30 template. If the template is a DNA molecule, preferably a 
DNA-depefident DNA polymerase extends the primer molecule. 
Most preferably, this DNA- dependent DNA polymerase is a 
thermostable DNA-dependent DNA polymerase. If an RNA 
molecule is the template, an RNA-dependent DNA polymerase 

35 (reverse transcriptase) is used. Preferably, this enzyme is 

thermostable . 

i 

A r^dioactively- labelled primer enables one to visualize 
the reaction product by exposing the separated reaction 
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products to an X-ray film. Alternatively, one may label the 
primer and detect the reaction products by non-radioactive 
methods known to one of skill in the art. 

dUTP as the Non-Canonical Nucleotide. 
5 In one embodiment of the invention, the non- canonical 

nucleotide dUTP is incorporated into a DNA sequence during 
DNA synthesis. The BESS method and some of its uses are 
illustrated in the figures using dUTP as an exemplary non- 
canonical nucleotide and uracil -DNA glycosylase as an 

10 exemplary N-glycosylase . For example, a specific mutation 
involving the loss or gain of a deoxythymidine (dT) residue 
may be detected after the incorporation of deoxyuridine (dU) 
into the DNA. Digestion of the dU-substituted sample with 
UDG and hydrolysis of the phosphodiester bonds at the abasic 

15 sites (points of dU removal) leads to a nested set of 
fragments ending at each of the abasic sites. In this 
embodiment, the loss or gain of a dT residue by mutation must 
occur on the same strand of the nucleic acid which will be 
detected following gel electrophoresis or another separation 

20 method. 

The incorporation may be during a PCR or another 
amplification involving DNA synthesis known to the art. A 
mixture of deoxynucleotides dGTP, dCTP, dATP, dTTP and dUTP 
is used as the precursor pool for incorporation into newly 

25 synthesized DNA. Although the nucleotide incorporated 

opposite a dA on the template strand is dT for the majority 
of the synthesized DNA molecules, dU constitutes the 
nucleotide at any such position for a fraction of the DNA 
molecules synthesized. 

30 The synthesized nucleic acid is then treated with an N- 

glycosylase, such as UDG, which removes only uracil from such 
sites where it has been incorporated as dU. This removal 
creates an "abasic" site. Because uracil is present in only 
a fraction of the possible incorporation sites, the nucleic 

3 5 acid is cleaved by UDG at that position in a fraction of the 
1 molecules. If dU is randomly incorporated into the nucleic 

l| acid at sites normally containing dT, UDG digestion and 
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phosphodiester bond scission generates a collection of 
molecules differing in length by the distance between dT 
residues in the original nucleic acid sequence. Such a 
nested set of molecules may then be resolved by 
5 electrophoretic separation means, and the precise location of 
the dT residues in the original nucleic acid may be inferred. 

Mutation Analysis 

Figure 2 describes the use of the BESS technique to 
examine specific mutations. Specifically, Figure 2 shows an 

10 example in which a mutation results from substitution of a 
single T residue in wild-type DNA by a C residue in mutated 
DNA. This change is detected using BESS as the loss of band 
on a PAGE gel for the sample with mutated DNA compared to the 
sample with wild- type DNA. 

15 Referring to Figure 2 both the wild- type DNA and the 

mutated DNA are denatured and annealed with labelled primers. 
The strand complementary to the one containing the T-to-C 
transition serves as the template for DNA synthesis. An 
appropriate non- canonical nucleotide is incorporated during 

20 the DNA synthesis reaction in place of a small portion of a 
canonical nucleotide. Figure 2 describes the products of the 
wild-type template and a template with an A-to-G mutation. 

Still referring to Figure 2, the products are exposed to 
an appropriate N-glycosylase. The N-glycosylase will excise 

25 the non-canonical base at all sites in which it is 

incorporated. The DNA is then treated so as to cleave it at 
the abasic sites and the fragments are separated. Figure 2 
illustrates the result of comparing the labelled reaction 
products from the wild- type samples and the labelled reaction 

30 products from the mutated samples. Multiple fragments will 
be obtained as a result of cleavage at all abasic sites where 
a non-canonical base was removed by an N-glycosylase. The 
example shown in Figure 2, depicts only the portion of the 
electrophoretic gel where a mutation is observed. 

35 If the above process is carried out using either (1) two 

PCR primers with different detection labels or (2) two 1 
separate PCR mixtures, each carried out with a different one 
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of the two PCR primers labelled, the position of dT in both 
strands of DNA may be determined. 

As seen in Table 1, ten out of twelve possible base 
substitution mutations of DNA involve the loss or gain of a T 
on one or the other strand. Therefore, an embodiment of the 
invention using dUTP as the non-canonical nucleotide can be 
used to locate most point mutations. Table 1 compares 
various mutations and detection of the mutation after UDG 
digestion. 



Table 1 

Detection of single-base substitution mutations usinq two 



MU TATION IN 
SENSE STRAND 


iiENiik STkAND " 

PRIMER 


ANTI-SENSE 
PRIMER 


Mutation 

DETECTED 
after UDG 


A^>TJ 

A — > C 
A — > T (U) 
G— > C 


gainT 


lose I 
lose T 
lose T 


YhS 
YES 
YES 
NO 
YES 
YES 
YES 
YES 
NO 
YES 
YES 
YES 


G— > T (U) 
G— > A 
C— > T (U) 
C— > A 
C— > G 

(U) T~> A 

(U) T~> G 

(U) T~> C 


gainT 
gainT 

lose T 
lose T 
lose T 


gain T 
gain T 
gain T 




6/12 detected 
with one primer 


6/12 detected 
with one primer 


10/12 detected 
with two primers 



Thus, ten out of twelve possible point mutations may be 
determined with great sensitivity by coupling an embodiment of 
the present invention using dUTP incorporation and UDG 
digestion and PCR with two different labelled primers. With 
one primer labeled, one -half (6 out of 12) of the possible 
mutations could be determined using this e-mbodiment of the 
invention. The detection of the other two mutations, dG to dC 
and dC to dG, requires the use of non-canonical nucleotides 
that specifically substitute for dG or for dC, and N- 
glycosylases which remove each respective non-canonical base. 
Both of these mutations could be detected using only one non- 
canonical nucleotide, substituting for dGTP or dCTP, and one 
corresponding N-glycosylase if two different labelled primers 
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were used (one for priming each strand of a DNA template) . 

Use of Other Non-Canonical Nucleotides 

For embodiments of the invention involving non-canonical 
nucleotides other than dUTP # it is necessary that one select 
N-glycosylases which will selectively cleave the respective 
non- canonical base from the synthesized DNA strand. There are 
other N-glycosylases known to the art (Demple and Harrison, 
1994 and Lindahl, 1979) which may be utilized to remove 
specific non-canonical nucleotides from DNA. Table 2 lists 
some of these enzymes. 



Table 2 



CANONICAL 
BASE in DNA 


NOW- 
CANONICAL 
NUCLEOTIDE 


NON- 
CANONICAL 
NUCLEOTIDE 
REFERENCE 


6LYC03VLA5E 


SOURCE OF 

GLYCO- 

SYLASE 


GLYCO- 
SYLASE 
REFERENCE 


T (tnynmne) 


dUTP 

(deoxyuridine 
triphosphate) 


Bessmans, et 
at.. 1558 


UDGorUNG 




Lindahl. 1974 


G (guanine) 


dITP 

(deoxyinosine 
triphosphate) 


Thomas, et al., 
1976 


HXN5 

(hypoxanthine- 
N-glyoosylase) 


a) calf thymus 

b) £. coTt 


Karran and 
Lindahl. 1980 
Karran and 
Lindahl. 1978 


C (cytosine) 


5-OHMe-dCTP 
(5- 

hydroxymethy) 
deoxycytidine 
triphosphate) 


Stahl and 

Charnbertin, 

1976 


hydroxy-methyl 

cytosine-N- 

glyoosytase 


calf thymus 


Cannon, et al., 
1988 



A non-canonical deoxynucleotide of the present invention 
must be incorporated into DNA with fidelity and must be 
compatible as a component of product DNA during subsequent 
rounds of DNA synthesis* That is, the non-canonical 
deoxynucleotide must not either terminate the extension of the 
product DNA strand or cause the polymerase to falter or result 
in a mutation on the complementary strand. In some cases, the 
presence of an unusual non-canonical nucleotide may alter the 
DNA synthesis reaction or the physical properties of the DNA 
into which it is incorporated so that DNA synthesis parameters 
need to be optimized. 

There are several other general requirements envisioned 

for the invention. The non-canonical nucleotide must be 

i 

obtainable by some means,; such as by chemical synthesis, by 
degradation of polynucleotides, by enzymatic modification of 
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nucleotides, or by a combination of these processes, and the 
product of said process or processes must then be converted 
into a deoxynucleoside triphosphate. Additionally, there must 
be a specific N-glycosylase to excise the non-canonical base 
from a nucleic acid into which it is incorporated. 

There are variations of the invention envisioned which 
allow for the position of canonical nucleotides to be 
specifically determined without direct incorporation during 
DNA synthesis of the non-canonical nucleotide recognized by 
the N-glycosylase. Thus, canonical or non-canonical 
nucleotides in DNA are first specifically converted to non- 
canonical nucleotides which are specific substrates for 
appropriate glycosylase enzymes. Such a conversion to a non- 
canonical nucleotide recognized specifically by an N- 
glycosylase may be accomplished by chemical or enzymatic 
means. The phosphodiester bonds at the abasic sites generated 
by glycosylase activity are then broken as described herein. 
For example, DNA with abasic sites may be prepared by 
deamidation of cytosine followed by treatment with UNG (Sagher 
and Strauss, 1985) . 



Use of N-Glvcosvlases 

Other specific N-glycosylases will become available and 
known to those of skill in the art. in order to determine 
whether or not an N-glycosylase is suitable for the present 
invention, one would first incorporate the non- canonical 
nucleotide into the synthesizing DNA and determine whether or 
not the non-canonical base can be specifically removed by the 
candidate N-glycosylase. The example above, describing the 
use of UDG, would serve as a comparative control. 

By "N-glycosylase" or "DNA-glycosylase" we mean an enzyme 
with N-glycosylase activity, whether or not the enzyme is 
formally called a glycosylase or has a glycosylase activity 
combined with other enzymatic activities. Glycosylases are 
sometimes referred to as "glycosidases, « and we therefore mean 
the definition of N-glycosylase to cover N-glycosidases . 

As defined herein, an N-glycosylase or DNA glycosylase is 
an enzyme that catalyzes hydrolysis of the bond between a non- 
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canonical nucleic acid base and a sugar in DNA to generate an 
abasic (AP) site. Such enzymes are present in many species. 
An example from Escherichia coli is uracil -DNA glycosylase 
(UDG) , also called uracil N-glycosylase (UNG) . Other examples 
5 are described by Demple and Harison (1994) and by Duncan 
(1981) . 

Cleavage at the Abasic Site 

Once one has created an abasic site, the method of the 
present invention requires that the synthesized DNA strand be 

10 cleaved at this site. There are various methods known to 
those of skill in the art to cleave an abasic site. Heat 
and/or basic conditions may be used to break the DNA molecule 
at the abasic sites. For example, the following protocol may 
be used: Nucleic acids containing abasic (AP) sites following 

15 removal of non-canonical bases are heated in a buffer solution 
containing an amine, for example, 2 5 mM Tris-HCl and 1 to 5 mM 
magnesium ions, for a period of 10 to 3 0 minutes at 70 °C - 
95°C. Alternatively, the following treatment may be used to 
break the DNA at abasic sites: 

20 1.0 M piperidine, a base, is added to DNA which has been 

precipitated with ethanol and vacuum dried. The solution is 
then heated for 30 minutes at 90 °C and lyophilized to remove 
the piperidine. 

Preferably, enzymatic treatment is used to break the DNA 

25 polymer at the abasic site. For example, apurinic/ 

apyrimidinic endonucleases (AP endonucleases) have been 
described which are able to cleave the phosphodiester bonds of 
DNA at apurinic or apyrimidinic (AP) sites (Lindahl, 1979; 
Demple and Harrison, 1994) . 

30 The employment of AP endonucleases such as Endonuclease 

IV from E. coli appears to be the preferred method of 
phosphodiester bond cleavage. As defined herein, an AP 
endonuclease is any enzyme that catalyzes cleavage of DNA at 
abasic (AP) sites. Such enzymes are present in many species. 

35 Examples of AP endonucleases from Escherichia coli include, 
but are not limited by, endonuclease III and pndonuclease IV. 
Also, E. coli exonuclease III in the presence^ of calcium ions 



WO 97/12061 



PCIYUS96/I5355 



-22- 

is an AP endonucl ase. Enzymes useful in the present 
invention include any enzyme with AP endonuclease-like 
activity, whether it is called by that name or by some other 
name . 

Endonuclease IV cleaves between the 5' -phosphate group of 
the AP residue and the deoxyribose ring of the adjacent 
nucleotide, generating a free 3'-hydroxyl group. in contrast 
to AP endonuclease -catalyzed cleavage, pyrolysis of the 
phosphodi ester bond may occur on either side of the AP 
residue, producing a mixture of 3' -phosphorylated ends on the 
cleavage products. Thus, the result of AP DNA cleavage with 
an AP endonuclease, such as Endonuclease IV, is a set of 
sharper bands after electrophoresis than is obtainable with 
pyrolysis . 

Examination of BESS Products 

Once cleavage has occurred at the abasic sites, one would 
wish to resolve the BESS products. Preferably, this 
resolution will be by gel electrophoresis or capillary 
electrophoresis. Then, the sizes of the fragments will be 
visualized either by staining the DNA molecules or by exposing 
radioactively labelled DNA molecules to X-ray film. The 
fragments may likewise be visualized using non- radioactive 
detection methods known to those of skill in the art. The 
reaction products from 1, 2, 3, 4 or more BESS reactions may 
be visualized in the same lane of an electrophoresis gel or a 
single capillary if different distinguishable non-radioactive 
labels are used for each reaction. 

A suggested protocol for electrophoresis of the BESS 
fragments is: a 0.2 - 0.4 mm gel of 6 or 8% polyacrylamide, 
89 mM tris-borate, pH 8.3, 1 mM EDTA (TBE buffer) and 7 M urea 
is used to separate DNA fragments. Denaturing loading buffer 
(95% formamide, 0.1% bromophenol blue, 0.1% xylene cyanol and 
10 mM EDTA, pH 7.6) is added to samples which were heated at 
85 - 95° for 5 minutes. Then, the samples are electrophoresed 
at 1200 - 2500 volts in TBE buffer. 

i 
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The BESS method has several advantages . The principal 
advantage is the ability to characterize nucleic acid 
molecules at the nucleotide level. This method allows one to 
select specific regions of DNA or RNA for analysis at the 
5 nucleotide level . 

Figure 3 demonstrates the coupling of the BESS protocol 
with an amplification reaction. Note that Figure 3 includes 
the step of incorporation of the non-canonical nucleotide in 
place of a small portion of a canonical nucleotide during PGR 

10 amplification. As described above for non-amplification 

products, the PGR product is exposed to an N-glycosylase that 
excises the non-canonical base at all sites in which it is 
incorporated. The samples are then treated to cleave the 
strands at the abasic sites. Figure 3 demonstrates the 

15 expected products that would occur after strand cleavage and 
demonstrates the analysis of the separated fragments. 

Another advantage of the invention over current 
technologies could be the generation of a pattern of bands 
which may be used to confirm the identity of an amplification 

20 product (see Figure 3, for example) . In some cases in the 

art, a PGR product of the correct molecular size is presumed 
to be derived from the desired template sequence. The DNA 
moiety may in fact be the amplification product of another 
nucleic acid template, and the result is then a "false 

25 positive". If the PGR product fragments generated by this 
invention are compared with a standard set of fragments 
derived from the sequence in question, they may be positively 
identified, even if there are mutations present. A mutation 
is recognizable as a deviation from the normal pattern of 

30 bands. The invention may avoid the occurrence of false 

positive results because the identity of the amplicon may be 
checked from its characteristic banding pattern after N- 
glycosylase digestion and phosphodiester bond breakage. 
Another advantage of the invention is that the 

35 amplification process itself can incorporate non-canonical 
nucleotides which are then detected in the product. 

An especially advantageous feature of the invention is 
that the amplification product does not require further 



A 



10 



15 



20 



WO 57,'i2061 

PCT/US96/1S35S 

-24- 

purification before the positions of the non-canonical 
nucleotide are determined. Many current methods of sequencing 
PCR products involve labor-intensive purification steps, 
followed by an additional single-strand amplification which 
incorporates label and chain terminating nucleotides. 
Purification of the amplicon is usually necessary to remove 
components of the original amplification, such as nucleotides, 
primers, truncated extension products and unwanted non- 
specific products of amplification. The present invention 
eliminates the need for purification steps prior to 
determining a nucleotide's position in the sequence. 

The process described in PCT WO 95/06752 wherein 
exonuclease III digestion can be used to determine the 
presence of alpha-borano nucleotide incorporation sites, also 
enables the direct sequencing of PCR products. However, the 
process generates extra and missing bands in sequencing gels. 
Although the reasons for these artifacts are still uncertain, 
extra bands may be caused by incomplete digestion by 
exonuclease III (exo III). The present invention utilizes an 
N-glycosylase to remove specific bases from DNA, which in turn 
allows chain cleavage at AP sites. Sequencing gel band 
artifacts common with the WO 95/06752 process should not occur 
using the current invention. 



25 



Use of BESS to Determine Kf ncleofiri^ 

By incorporating non-canonical deoxynucleoside 
triphosphates in place of a portion of the four canonical 
deoxynucleoside triphosphates and by using appropriate N- 
glycosylase enzymes, the positions of all four canonical 
nucleoside triphosphates may be determined using the 
30 invention. Therefore, the complete sequence of the nucleic 
acid may be determined using the present invention. In this 
embodiment, there are separate N-glycosylases to remove each 
specific non-canonical nucleotide incorporated into a nucleic 
acid (see Figure 4) . m such a case, there are distinct non- 
canonical nucleotides incorporated into each of four versions 
of the nucleic acid. The four versions of the nucleic acid 
ar<| then separately digested by N-glycosylases specific for 



35 
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each non-canonical nucleotide. After the glycosylase 
digestion and phosphodiester bond cleavage, the fragments are 
separated by gel electrophoresis, capillary electrophoresis or 
- other means . Cleavage of the phosphodiester bonds of the 
5 sugar- phosphate moiety is carried out by non-enzymatic beta- 
elimination or by enzymatic means. If each non-canonical 
nucleotide is a specific substitute for each of the four 
common nucleotides, the complete sequence of said nucleic 
acid may be determined. 

10 Figure 4 describes a method of DNA sequencing in which a 

labelled primer is first annealed to a denatured sample of 
DNA. Separate A, T, G and C- specific reactions are created to 
incorporate an appropriate non-canonical nucleotide in place 
of a small portion of the canonical nucleotide during DNA 

15 synthesis. We note that differently- labelled non-radioactive 
primers may be used for each reaction to permit single lane 
detection. Referring to Figure 4, the symbol Am, Tm, Cm and 
Gm refer to "modified A, " "modified T, " "modified C" and 
"modified G." These "modified bases" are the non-canonical 

2 0 nucleotides . 

Still referring to Figure 4, the products of each 
nucleotide-specif ic reaction are then contacted with an N- 
glycosylase that specifically excises the appropriate non- 
canonical base at all sites in which it is incorporated. 

25 After non-canonical base excision, the products are treated to 
cleave at the abasic sites, preferably using an AP 
endonuclease such as E. coli endonuclease IV. These fragments 
may then be separated, preferably by denaturing polyacrylamide 
gel electrophoresis, by capillary electrophoresis, or by other 

30 methods, and analyzed. The sequence of the template molecule 
may be determined from this comparison, as demonstrated in 
Figure 4 . Figure 4 also demonstrates how the sequence can be 
determined from one- lane electrophoresis if differently 
labelled non- radioactive primers are used for each reaction. 

35 Alternatively, one could perform all the nucleotide- 

^ specific reactions in one large reaction providing that the 

j incorporated non-canonical nucleotides do not cause mutagenic 

events. This reaction can be aliquoted into separate 
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reactions for treatment with the appropriate N-glycosylase . 
The products can then be separated and analyzed as before. 

Non-canonical nucleotides may be incorporated using cycle 
sequencing reactions in which each of four parallel reactions 
contains one non-canonical nucleotide along with the four 
common nucleotides. Each reaction is then digested by an N- 
glycosylase specific for the non-canonical nucleotide 
incorporated. The products obtained after phosphodiester bond 
breakage may then be resolved using denaturing gel 
electrophoresis or other methods as previously described. 

Use of BESS to D etermine Similarities or 
Differences be tween Nucleic Aeiria 

Just as the electrophoretic pattern of DNA fragments 
obtained by digesting an entire DNA molecule with different 
restriction enzymes can be used to characterize and roughly 
identify the DNA molecule (by a restriction map) , the 
electrophoretic pattern of DNA fragments generated using the 
BESS method can be used to characterize and identify nucleic 
acid molecules at the more precise nucleotide level. Thus, 
the BESS method, even using a single non-canonical nucleotide 
and a single N-glycosylase, may be used to determine if and to 
what extent different nucleic acid samples are similar or 
different at the nucleotide level. 

Figure 5 demonstrates the use of the BESS method to 
determine similarities and differences between nucleic acids. 
The method begins with annealing a labelled primer 1 and an 
unlabelled or differently labelled primer 2 to a denatured DNA 
template. (The reaction may also be performed using only a 
single primer . ) 

Non-canonical nucleotides are then incorporated .in place 
of a small portion of a canonical nucleotide using 
in vitro DNA synthesis. Figure 5 demonstrates examples of 
synthesis products from four different DNA template samples to 
be evaluated. The products of each reaction are treated with 
an N-glycosylase that specifically excises the incorporated 
non-canonical base at all sites in which it is incorporated. 
The DNA is then treated so as to cleave at the abasi$ sites, 
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preferably using an AP endonuclease such as E. coli 
endonuclease IV. 

Still referring to Figure 5, the cleavage products of all 
four samples are illustrated. These cleavage samples are 
5 separated, preferably by denaturing polyacrylamide gel 

electrophoresis or another method, as described herein, and 
the labelled fragments are detected. Many bands would be 
generated from most samples and the Figure 5 example shows 
only a few bands for illustrative purposes. 

10 Kit for Characterizing Nucleic Acids 

The present invention is also a kit for characterizing 
nucleic acids. In one embodiment this kit will comprise an 
enzyme capable of extending a primer which is hybridized to a 
nucleic acid template, four canonical deoxynucleoside 

15 triphosphates, at least one non- canonical deoxynucleoside 
triphosphate, an N-glycosylase corresponding to each non- 
canonical nucleotide, and an AP endonuclease. Preferably, the 
AP endonuclease is Endonuclease IV. 

Another form of the kit would be useful for one who was 

20 performing characterization on a specific DNA template. In 
this embodiment of the kit, at least one primer designed to 
hybridize to a short section of the specific DNA template 
would be provided in the kit. The kit may also contain other 
components required to couple BESS with an amplification 

25 method. For example, a kit for coupling BESS with PCR 

amplification of a specific nucleic acid sequence may contain 
two different primers, labelled and/or unlabelled, which flank 
the sequence of interest. 

In a most preferred embodiment of the kit, the enzyme 

3 0 would be a thermostable DNA polymerase. 

Another embodiment of the kit would be designed for DNA 
sequencing applications. In this embodiment, the kit would 
contain four different non-canonical nucleotides designed to 
replace a portion of each of the four different canonical 

35 nucleotides and four different N-glycosylases which catalyze 
specific removal of each of the non -canonical bases following 
their incorporation into DNA. Alternatively, the kit could 
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contain only two or three non-canonical nucleotides, if both 
strands of a specific DNA are to be sequenced. 

EXAMPLES 

Example 1 

This example demonstrates the detection of a single-base 
mutation of clinical significance using the BESS method. The 
example also demonstrates coupling of BESS with PCR. Coupling 
BESS with amplification permits one to obtain results with 
smaller amounts of sample nucleic acid, which may be limited 
in some situations. 

A polymerase chain reaction (PCR) was performed to 
amplify a region of the human S-globin gene contained within 
the plasmids pHB4 or P Sic2. The plasmids were constructed by 
the insertion of a 290 bp segment of the human S-globin gene 
into the vector pT7Blue (Novagen, Madison, WI) . The two 
plasmids differ only by the substitution of an adenosine by a 
thymidine in the sense strand of the globin gene portion of 
pSic2 . Such a mutation in human DNA is the molecular cause of 
the 6 s or sickle globin defect (Kazazian, 1989) . The primers 
used to amplify the globin gene were the standard forward and 
reverse M13 sequencing 24-mer oligonucleotides, 5'- 
CGCCAGGGTTTTCCCAGTCACGAC-3 ' and 5 ' -AGCGGATAACAATTTCACACAGGA-3 ' 
(SEQ ID NOs:5 and 6), respectively. 

100 picograms of plasmid pHB4 or pSic2 DNA was added to 
PCR reactions of 50 microliters, containing 50 mM Tris-HCl pH 
9.0, 20 mM ammonium sulfate, 1.5 mM magnesium chloride, 0.2 mM 
each of dATP, dCTP, and dGTP, 0.16 mM dTTP, 0.04 mM dUTP, 0.1 
microgram of unlabeled forward primer, 0.1 microgram of "P- 
end-labeled reverse primer, and 2.5 units of 2£h. DNA 
polymerase or Tas DNA polymerase (Perkin-Elmer Corporation, 
Branchburg, NJ) . The reactions were heated at 94° for 2 
minutes, cycled at 95° for 30 seconds, 55° for 30 seconds, and 
72» for 30 seconds for 30 repetitions, and finally held at 4°. 

Following the amplification, the sample was treated with 
one unit of UDG (Epicentre Technologies, Madison, WI) at 37° 
for 20 minutes, and the 1 phosphodiester backbone of the 
amplicon was cleaved ati sites of uridine removal by heating at 
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70° for 10 minutes. The sample was diluted with 95% 
formamide, 10 mM EDTA, pH 9.5, 10 mM NaOH and 0,1% each of 
bromophenol blue and xylene cyanol . 

No purification of the reaction products was performed. 
5 Rather, the products of the reaction were heat denatured and 
directly electrophoresed in an 8% polyacrylamide, 8 M urea 
gel. The gel was fixed, dried and exposed to X-ray film. 
Parallel samples containing sequencing reactions for the same 
plasmids were generated using the SequiTherm™ Cycle 
10 Sequencing Kit (Epicentre Technologies, Madison, WI) according 
to the manufacturer's directions for a 32 P-end- labeled reverse 
primer. 

The results show that the UDG-digested products of the 
pHB4 amplicon containing deoxyuridine displays an additional 

15 band at the site of the A to T mutation which is 

characteristic of the sickle globin gene. The corresponding 
sequences obtained by cycle sequencing were electrophoresed in 
adjacent lanes for comparison. The UDG-generated lanes have 
the same pattern of bands as the n T" lane of the cycle 

2 0 sequencing products, except, as expected, the bands from the 
UDG- treated material migrate slightly faster. The mutation of 
the sickle gene is also clearly visible in the cycle 
sequencing lanes. The same results were obtained with either 
Tth DNA polymerase or Tag DNA polymerase . 

25 Example 2 

The following example demonstrates that BESS, coupled 
with an amplif ication method, is capable of detecting a 
clinically-significant single-base mutation, even when the 
mutation is present in only one allele of a heterozygous 

30 individual. The DNA samples used were obtained from humans 
who were either homozygous for the normal allele or 
heterozygous for the normal and mutant alleles. The results 
obtained using BESS were confirmed by sequencing both normal 
and mutant -containing DNA samples using standard methods. 

35 The alpha^antitrypsin gene was amplified from human DNA 

using a protocol similar to that of Example 1. The Z gene 
mutation ^f the alpha^antitrypsin gene is a G-to-A transition 
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in exon five, which leads to an amino acid change of glutamate 
to lysine at position 342. An embodiment of the present 
invention was used to detect the Z gene mutation in one allele 
of a heterozygous MZ individual. 

The »M» form of the alpha, -antitrypsin gene is the normal 
or wild- type condition. Amplification reactions contained 
one-half microgram of human genomic DNA from apparently normal 
or MZ alpha,-antitrypsin heterozygous persons (Coriell 
Institute for Medical Research, Camden, NJ) as the template in 
place of plasmid DNA. The primers, 5 ' -GAGGAGGAGCAAGGCCTATGTG- 
3' and 5 ' -GGGATTCACCACTTTTCCCATG-3 ' , (SEQ ID NOs:l and 2) were 
chosen to amplify parts of intron four and exon five of the 
alpha.-antitrypsin gene (Cox, 1990) . The reactions contained 
50 mM Tris-HCl pH 9.0, 20 mM ammonium sulfate, 1.5 mM 
magnesium chloride, 0.2 mM each of dATP, dCTP, and dGTP, 0.16 
mM dTTP, 0.04 mM dUTP, 0.1 microgram of each of the primers 
described above, and 2.5 units of Tag DNA polymerase (Perkin- 
Elmer, Branchburg, NJ) . 

The reactions were submitted to 35 cycles consisting of 
91° for 1.5 minutes, 55° for 30 seconds, and 68° for 1 minute, 
then were held at 4°. The amplicons resulting from 
amplification in the presence of dUTP were treated with UDG 
and pyrolyzed as described in Example 1. 

Without further purification steps, both the UDG- treated 
amplicons and the cycle sequence reaction samples were 
electrophoresed in a polyacrylamide-urea gel as described in 
Example 1. The MZ heterozygous DNA yielded an additional »T" 
band at the position of the mutation which is one-half the 
intensity of the other T bands, because only one amplified 
allele contains the Z mutation. Using BESS, the AAT Z 
mutation was detectable in the amplification products obtained 
from a mixture of normal and mutant alleles in chromosomal 
DNA. 

As a control display of the Z mutation, the PCR products 
of both normal and MZ heterozygote genes were sequenced. 
Amplicons synthesized without dUTP and containing only the 
canonical four deoxynucleotides were electrophoresed in a 
discontinuous 13% polyacrylamide gel containing sodium dodecyl 
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sulfate detergent. The 310 bp amplicon was localized by- 
staining with ethidium bromide, and cut from the gel. The PCR 
products were purified by crushing and soaking the gel 
fragment in 0.5 M ammonium acetate, 1 mM EDTA, and 0.1% SDS 
5 using methodology known to the art. This DNA was used as the 
template to generate cycle sequencing data using 32 P end- 
labeled primers as described in the product information for 
SequiTherm™ Cycle Sequencing Kits (Epicentre Technologies) . 
Standard cycle sequencing performed as described in Example 1 
10 confirmed the sequence of the normal and mutant DNA samples 
that were obtained using BESS coupled with PCR. 

Example 3 

The effectiveness of endonuclease IV digestion of AP DNA 
generated using BESS is demonstrated in Example 3 . As a 

15 standard of reference, the T lane of the SequiTherm- sequenced 
AAT PCR product (Example 2) was compared with the results of 
BESS in several of its embodiments. The first embodiment was 
UNG digestion of dU-substituted PCR product (37° for 10 
minutes) followed by pyrolysis (90° for 20 minutes) to cleave 

20 the phosphodiester bonds at AP sites. The second is the same 
as the first, except that endonuclease IV was added 
concurrently with UNG, and no 90° heat step followed. The 
third is the same as the second, except that EDTA was added to 
10 mM final concentration prior to UNG and endonuclease IV 

25 addition. Several conclusions may be reached from the 
experiment : 

(1) The products of the pyrolysis of AP DNA result in 
electrophoretic bands that are less sharp than those resulting 
from endonuclease IV cleavage of the same amplification 

3 0 product . 

(2) The endonuclease IV reaction allows the polymerase 
to extend, in some cases, the cleavage product 3'-hydroxyl 
ends, resulting in spurious bands. The addition of a 
chelating agent such as EDTA before or concurrently with UNG 

35 and endonuclease IV prevents the polymerase from extending tfye 
cleavage product ends and eliminates the spurious bands. ,| 

(3) As expected, the BESS products migrate more rapidly 
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during electrophoresis than the corresponding products from 
Sanger cycle sequencing reactions using SequiTherm™. The 
migration difference is more apparent at the bottom of the 
electrophoresis gel. 

(4) Art if actual bands obtained in cycle sequencing of 
the PCR products were eliminated using the invention. More 
accurate sequence information may be obtained using the 
invention in its preferred embodiments than using methods nc 
known to the art. 



Example 4 

The second exon of the DQBl gene of the human HLA class 
II complex is a highly polymorphic region (Bugawan and 
Ehrlich, 1991; Thorsby and Ronningen, 1993). Analysis of the 
HLA regions is useful for tissue typing prior to tissue or 
organ transplantation or for identity testing in forensics or 
other fields. To demonstrate the usefulness of BESS to 
distinguish QBl alleles, several genotypes of the HLA gene 
were cloned into a P UC19 plasmid vector and subjected to 
analysis using BESS. In this example, incorporation of a non- 
canonical nucleotide into DNA according to the BESS procedure 
was carried out using only a single primer and multiple rounds 
of DNA synthesis in a thermocycler . Thus, only a linear 
amplification of only one strand of the nucleic acid template 
was obtained, in a manner similar to cycle sequencing. 
Therefore, the BESS procedure was used to linearly amplify 
single-stranded copies of DQBl genes from the plasmids. As 
described previously, dU was partially substituted for dT in 
the single-stranded amplification products. Following 
digestion with UNG and endonuclease IV, the products were 
electrophoresed in all lanes adjacent to »T« reaction products 
obtained from standard cycle sequencing of the same plasmids. 

Primers DB130 (5 ' -AGGGATCCCCGCAGAGGATTTCGTGTACC-3 ' ) and 
GH2 9 ( 5 ' -GAGCTGCAGGTAGTTGTGTCTGCACAC- 3 • ) ( SEQ ID NOs : 3 and 4 ) 
(Bugawan and Ehrlich, 1991) were used to amplify exon 2 of the 
DQBl gene from human peripheral blood DNA (Novagen, Madison, 
WI) . The blood samples from five individuals. 'were used for 
the DNA preparation, which should contain mani alleles of the 
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HLA gene target. Fifty microliter reactions containing 50 mM 
Tris-HCl pH 9.0, 2 0 tnM ammonium sulfate, 1 . 5 mM magnesium 
chloride, 0.2 mM each of dATP, dGTP, dTTP and dCTP, 0,1 
microgram each of DB130 and GH29 primers, 0.5 microgram human 
5 DNA, and 1 unit of Tag DNA polymerase ( Perkin- Elmer , 

Branchburg, NJ) were amplified in an MJ Research DNA Engine 
thermal cycler. The samples were denatured at 94° for 3 
minutes, then subjected to thirty cycles of 93° for 10 seconds 
and 60° for 40 seconds, followed by 75° for 3 minutes. The 

10 thermal cycler was programmed to begin timing when the sample 
reaches the designated temperature. 

The resulting 250 bp PCR product was digested with 
restriction endonucleases Bam HI and Pst I, which cleave the 
product within its primer-encoded regions. The PCR product 

15 was purified by polyacrylamide gel electrophoresis and elution 
from gel fragments. Plasmid pUC19 was digested with Bam HI 
and Pst I, treated with HK Phosphatase (Epicentre 
Technologies, Madison, WI) , and ligated with the PCR product. 
Ampicillin-resistant colonies were screened for the presence 

2 0 of the PCR product in their plasmids by PCR using standard 

forward ( 5 ' - CGCCAGGGTTTTCCCAGTCACGAC- 3 ' ) and reverse ( 5 ' - 
AGCGGATAACAATTTCACACAGGA-3 ' ) M13 primers (SEQ ID NOs : 5 and 6). 
Candidate bacterial colonies were swabbed with a polypropylene 
pipette tip and added to PCR mixtures which were cycled as 
25 above in a DNA Engine thermal cycler. PCR products encoding 
HLA genes were mapped by restriction endonuclease digestion 
(Mitsunaga, et al . # 1995) Three unique clones were selected 
for further analysis using BESS. 

For BESS analysis, DNA synthesis was primed from the 

3 0 three HLA clones in a manner analogous to cycle sequencing, 

but with a mixture of dGTP, dCTP, dTTP, dATP and dUTP as 
previously described in place of a dideoxy- termination 
mixture. The single- stranded products of thirty cycles of 
linear amplification were subjected to UNG and endonuclease IV 
35 digestion in the presence of 10 mM EDTA as previously 
described, and were electrophoresed in a denaturing 
polyacrylamide gel. For comparison, the nucleotide sequences 
of the same three clones were determined using the SequiTherm 
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Cycle Sequencing Kit (Epicentre Technologies, Madison, WI) . 

The BESS technique gave a unique set of bands for each 
HLA genotype, well correlated with the T sequencing lanes 
obtained by cycle sequencing each clone, except that, as 
expected and as in the previous examples, the BESS-derived 
bands migrated slightly more rapidly in the gel than the 
corresponding dideoxy- sequencing bands. The sequence- related 
data obtained using BESS was even more accurate than that 
obtained using standard cycle sequencing. Thus, the presence 
of dideoxy-sequencing artifacts in one region of the gel that 
were not present in the BESS products lanes aided the correct 
interpretation of the DNA sequence data. From this example, 
it is clear that the invention is not limited to sequence 
screening of PGR products or of double- stranded DNA molecules. 

A kit may be made for determining the HLA genotype of 
human DNA samples using BESS. A kit containing the reagents 
listed below could be used in conjunction with a thermostable 
DNA polymerase such as Tth. or Tag DNA polymerase (Perkin-Elmer 
Corp.) in order to perform PCR. The kit might contain: HLA- 
specific primers at 0.1 micrograms per microliter; a dNTP 
mixture containing 2.5 mM each of dGTP, dCTP and dATP, 2 mM 
dTTP and 0.5 mM dUTP; a DNA polymerase buffer containing i.o M 
Tris-HCl, pH 9.0, and 400 mM ammonium sulfate; 1 unit per 
microliter of uracil-DNA glycosylase; 50 mM EDTA; 25 mM 
magnesium chloride; 1 unit per microliter of E. coli 
Endonuclease IV; and a Stop Buffer containing 95% formamide 
10 mM EDTA, pH 9.5, lOmM NaOH and 0.1% each of bromophenol ' 
blue and xylene cyanol. 

The kit could be used in a similar manner to methods 
described in Examples 1 through 4 and Figs. 1 through 5. 
Specifically, the-DQBl group of HLA genes could be amplified 
from human DNA obtained from peripheral blood or cheek 
(buccal) swabs in a 50 microliter mixture containing 50 mM 
Tris-HCl, pH 9.0, 20 mM ammonium sulfate, 1.5 mM magnesium 
chloride, 0.2 mM each of dATP, dGTP and dCTP, 0.16 mM dTTP, 
0.04 mM dUTP, 0.1 micrograms each of primers DB130 and GH29 
(SEQ ID NOs:3 and <4) , approximately 0.5 micrograms of human 
DNA, and 1 unit of: Tag; DNA polymerase (Perkin-Elmer) . The 
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mixture could be denatured at 94 °C for three minutes, and then 
cycled thirty times at 93 °C for 10 seconds and 60°C for 40 
seconds and then ten minutes at 75°C. Then, after addition of 
EDTA to a final concentration of 10 mM, the amplification 
5 products could be cleaved by treatment with 1 unit each of 
uracil -DNA glycosylase and Endonuclease IV for ten minutes at 
room temperature. Following addition of the Stop Buffer, the 
samples could be denatured by incubating for five minutes at 
95°C and then analyzed by electrophoresis in an 8% 

10 polyacrylamide-8 M urea gel. The migration of the bands 
generated from samples using the BESS procedure could be 
compared to BESS reaction products from cloned DQB1 genes, 
perhaps included as a control in the kit, or with DNA size 
standards, which could also be included in the kit. The 

15 primers in the kit may be labelled by some means, preferably 
by a non- radioactive means, and an automated scanner and/or 
image analysis system could be used to compare the band 
patterns of samples with the controls. 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (oglionucleot ide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GAGGAGGAGC AAGGCCTATG TG 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (oglionucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
GGGATTCACC ACTTTTCCCA TG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (oligonucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
t AGGGATCCCC GCAGAGGATT TCGTGTACC 

i 
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(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (oligonucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GAGCTGCAGG TAGTTGTGTC TGCACAC 27 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (oligonucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CGCCAGGGTT TTCCCAGTCA CGAC 24 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (oligonucleotide) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AGCGGATAAC AATTTCACAC AGGA 24 
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CLAIMS 

We claim: 

1. A method for characterizing a nucleic acid molecule 
comprising: 

a) synthesizing DNA in the presence of a reaction 
mixture comprising 

5 (i) a nucleic acid template, 

(ii) a primer molecule that contains a nucleotide 
sequence at least part of which is sufficiently complementary 
to a portion of the template to hybridize therewith, 

(iii) an enzyme that extends the primer so that a DNA 
10 molecule may be synthesized, 

(iv) four canonical deoxynucleoside triphosphates, and 

(v) at least one non-canonical deoxynucleoside 
triphosphate, wherein the non-canonical deoxynucleoside 
triphosphate is incorporated into the synthesized DNA in place 

15 of only one canonical deoxynucleoside triphosphate; 

b) contacting the synthesized DNA with an N-glycosylase 
that excises a base portion of the non-canonical 
deoxynucleoside triphosphate from said synthesized DNA, 
whereby an abasic site is created; 

20 c) treating the DNA in such manner that said treatment 

results in breakage of a phosphodiester backbone at all abasic 
sites, whereby at least two DNA fragments are created; and 
d) separating the fragments according to size. 

2. The method of claim 1 wherein the fragments are 
detected by transferring said size -separated fragments to a 
solid support and then revealing the presence of said 
fragments by hybridization thereto of a labelled complementary 

5 probe . 

3 . The method of claim 1 wherein the synthesized DNA is 
labelled with a detectable probe. 

4. The method of claim 1 wherein the template is a DNA 
molecule and the enzyme of step (a) is a DNA-dependent DNA ' 
polymerase . «! 
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5 . The method of claim 4 wherein the enzyme is 
thermostable . 

6 . The method of claim 1 wherein the template is an RNA 
molecule and the enzyme of step (a) is an RNA-dependent DNA 
polymerase . 

7 . The method of claim 6 wherein the enzyme is 
thermostable. 

8. The method of claim 1 wherein the non-canonical 
deoxynucleoside triphosphate is 2 ' -deoxyuridine-5 ' -triphosphate 
and the N-glycosylase is uracil -N-glycosylase . 

9. The method of claim 1 wherein the non-canonical 
deoxynucleoside triphosphate is 2 ' -deoxyinosine-5' -triphosphate 
and the N-glycosylase is hypoxanthine-N-glycosylase . 

10. The method of claim 1 wherein a non-canonical 

base is modified after incorporation into the synthesized DNA 
by treating the synthesized DNA in order to make the modified 
base a non- canonical base which is_susceptible to loss upon 
treatment of the synthesized DNA with an N-glycosylase. 

11. The method of claim 1 additionally comprising 

the step of providing three additional DNA synthesis reactions, 
to provide a total of four DNA synthesis reactions, wherein a 
different non- canonical deoxynucleoside triphosphate is used 
5 in each reaction. 

12 . A method for determining the complete nucleotide 
sequence of a nucleic acid template by analyzing the nucleic 
acid fragments of claim 11. 

13. The method of claim 1 wherein step (c) comprises 
heating the DNA to break the phosphodiester backbone at all 
abasic sites. ; 
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14. The method of claim 1 wherein step (c) comprises 
using a basic solution to effect the breakage of the 
phosphodiester backbone at all abasic sites. 

15. The method of claim 1 wherein step (c) comprises 
using an enzyme to effect the breakage of the phosphodiester 
backbone at all abasic sites. 

16. The method of claim 15 wherein step (c) 
comprises using an AP endonuclease to effect the breakage 
of the phosphodiester backbone at all abasic sites. 

17. The method of claim 1 additionally comprising 
the step of providing at least two different non- canonical 
nucleotides in the synthesizing step, wherein the reaction is 
divided into separate aliquot s in the contacting step so that 

5a different N-glycosylase is provided for each aliquot. 

18. The method of claim 1 wherein DNA synthesis is 

a part of a method for amplification of the template nucleic 
acid. 

19. The method of claim 1 wherein DNA synthesis is 

a part of amplification of a template nucleic acid using the 
Polymerase Chain Reaction. 

20. The method of claim 1 wherein DNA synthesis is 
a part of a NASBA reaction. 

21. The method of claim 1 wherein DNA synthesis is 
a part of 3 SR. 

22. The method of claim l wherein DNA synthesis is 
a part of Strand Displacement Amplification. 

23. The method of claim l in which electrophoresis 
is used to separate the DNA 'fragments . 

i 
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24. A kit for characterizing a nucleic acid template, 
comprising 

an enzyme capable of extending a primer bound 
to a nucleic acid template so that a DNA molecule may be 
5 synthesized, four canonical deoxynucleoside triphosphates, 
at least one non- canonical deoxynucleoside triphosphate, an 
N-glycosylase capable of excising the base portion of each 
respective non-canonical deoxynucleoside triphosphate from 
said synthesized DNA, and an AP endonuclease . 

25. The kit of claim 24 additionally comprising at 
least one primer molecule. ' 
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Sample Nucleic Acid Template, usually present in muUipIt copies 

I DNA Polymerase 

Divalent I dATP, dCTP, dGTP, dTTP 

cation \I/ and a non-canonical nucleotide 

▼ (e.g.dUTP) 

* — i U 

-a u u \ Example of 

/ 3 Products 

+ — ^ u ' 

I N-Glycosylase 
\Af (e.g. Uracil DNA Glycosylase) 

* * ♦ 1 Uracil 

* — * " + 2 Uracils 



© Anneal labeled Primer to DNA template. 

(2) Incorporate the appropriate non-canonical 
nucleotide in place of a small portion of a 
canonical nucleotide during in vitro DNA 
synthesis. 



© Contact the in Wf/o-synthesized DNA 
with an N-gtycosylase that excises the 
non-canonical base at alt sites in which 
it is incorporated. 



* — * — + , Uraci) 



Strand cleavage 

(e.g. using £ coii Endonuclease IV) 



0 



±) After non-canonical base excision, treat 
the DNA so as to cleave it at abasic sites. 



* — x. 



} X3 



Separate Fragments 
Detect labeled Fragments 




(5) Separate the fragments 
(e.g. by denaturing PAGE). 

® Detect the labeled fragments (e.g.s X-ray 
film for a radioactivety-labeled pnmer or a 
fluorescent detector for a fluorescentiy- 
labeled primer). 
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Wild-type Template DNA 



3' 



Primar-t 



5' 3' 



Template with T-tC Mutation 
PrimeM 

r, 



5' 



(D Denature sample DNA and 
anneal labeled Primers. 



5' 



3' 5 1 



Priroor-2 



Primw-2 



(2) Incorporate an appropriate 
non-canonical nucleotide (e.g. 
dUTP) in place of a small 
portion of a canonical nucleo- 
tide during DNA synthesis. 



Example of products obtained 
as a result of DNA synthesis from 
Primer- 1 on a Wild-type template 



Example of products obtained as a 
result of DNA synthesis from Primer-1 
on a template containing a mutation 



♦-Labeled *-tabeled 
Size Wild-type Mutated 
Standards Samples Samples 



Contact the products with an 
N-gtycosylase (e.g. Uracil DNA 
Glycosylase) that excises the 
non-canonical base at all sites 
In which it fs incorporated. 

After non-canonical base 
excision, treat the DNA so as 
to cleave it at abasic sites (e.g. 
using £ coti Endonuclease IV). 



® Separate the fragments 
(e.g. by denaturing PAGE). 

© Detect the labeled fragments (e.g. using an 
infrared fluorescent detector for a primer 
labeled with an infrared fluorescent dye). 



Although multiple fragments are obtained as a result 
of cleavage at all abasic sites where a non-canonical 
base was removed by an N-glycosylase, the example 
at (he left depicts any the portion of the gel where a 
mutation is observed. 



NOTE: The BEStf 1 * procedure may be performed 
simultaneously using 2 different primers in the same 
reaction if distinguishable differently-labeled primers 
are used. 
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Primer- 1 

3' ...^.TTT^. 



Sample Nucleic Add Template 



Pmner-2 



Divalent 
cation 

* — ^~ 



l DNA Polymerase 
I dATP, dCTP, dGTP, dTTP 



and a non-canonical nucleotide 
<e.g. dUTP) 

U 



5* 

® 



Denature sample DNA and anneal labeled 
Primer-1 and an unlabeled or difierently- 
labeled Primer-2 to DNA template. 



Incorporate the appropriate non-canonical 
nucleotide in place of a small portion of a 
canonical nucleotide during PCR amplification. 
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-U- 



-u- 
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Example of 
3 Products 



* — 



-U- 



-U- 



* — 



* — 



* — 



* — 



* — 



N-Glycosylase 

(e.g. Uracil DNA Glycos/tase) 



— • 



♦ 1 Uracil 
+ 3 Uracils 

♦ 3 Uracils 



Strand cleavage 

(e.g. using £. cofi Endonuclease IV) 



— • 



© 



Contact the PCR product with an N-glycosylase 
that excises the non-canonical base at all sites 
in which it is incorporated. 



After non-canonical base excision, treat 
the DftA so as to cleave it at abasic sites. 



* — 



-x — • 



Separate Fragments 
Detect Labeled Fragments 



Size * -Labeled • Labeled 
Standards Samples Samples 



© Separate the fragments 
(e.g. by denaturing PAGE). 

Detect the labeled fragments (e.g.s X-ray 
film for a radioactivery-labeled pnmer or a 
fluorescent detector for a f luorescently- 
labeled primer). Multiple differently-labeled 
primers may be detected in the same lane 
if they can be detected differentially. 
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Primer- 1 
* * 



c 3* 

5 Sample Nucleic Add Template 

I DNA Polymerase 

Divalent I <JATP ( dCTP. dGTP. dTTP 
cation ^# an d a non<anonical nucleotide. 

• — — Am— .. '.77. . . 

» i — ^"vrn r. y 

3 ' Tm *Tr7TTTr. A r. 

'laws ' 

» — fc —Cm— — ~ 

*— ^Cm— j-. 

T 4 _ fc Gm 

t .rr? * 

Clea£!e*Su«s Ctea^Products 



© Denature sample C 
anneal labeled Prir 



jDNA and 
(Primer. 

(2) In separate A, T, G and C-specific reactions, 
incorporate the appropriate non-canonical 
nucleotide in place of a small portion of a 
canonical nucleotide during DNA synthesis 
(e.g. -modified A* (Am), "modified r fTm), 
^modified C* (Cm) and "modified G" (Gm)). 
Differently-labeled non-radioactive primers 
may be used for each reaction to permit 
single-lane detection. 



(3) Contact the products of each reaction set 
with an N-glycosylase that specifically 
excises the appropriate non-canonical base 
at all sites In which K is incorporated. 

® After non-canonical base excision, treat 
the DNA so as to cleave it at abastc sites 
(e.g. using £ coff Endonuclease lv)- 



C*specific 
Cleavage Products 



G-spectfic 
Cleavage Products 



(D Separate the fragments 
(e.g. by denaturing PAGE). 



T-Speclfic A-SpecKIc ^Specific G-Specrflc 
Reactions Reactions Reactions Reactions 



Combined 4-Label 
1-Lane Reactions 




Detect the labeled fragments (e.g.s 
in 4 separate lanes using X-ray film 
for a radioactively-labeled primer or 
another appropriate method for a 
non-radioactive label; or in 1 lane 
using a method that can differentially 
detect 4 differently-labeled non-radio- 
active primers). 



Sequence would be read from 
bottom to top of gel as: 

5'-CAATGCTG-3' 
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Pnmar-i 
* * 



Sample Nucleic Acid Template 



Prime/*2 



Divalent 
cation 



DNA Polymerase 
dATP, dCTP, dGTP, dTTP 
and a non-canonical nucleotide 
(e.g. dUTP) 



5* ® For each template to be screened, denature 
sample DNA and anneal labeled PrimeM and 

3* an unlabeled or differently-labeled Primer-2 to 

DNA template. Reactions may also be 
performed using only a single primer. 

(D Incorporate the appropriate non-canonical 
nucleotide in place of a small portion of a 
canonical nucleotide during in vitro DNA 
synthesis. 



-u- 



— U- 



Example or a DNA synthesis 
product from Sample- 1 

Example of a DNA synthesis 
product from Sample-2 

Example of a DNA synthesis 
product from Sample-3 

Example of a DNA synthesis 

product from Sample-4 /s\ . fc 

Contact the products of each reaction set 
with an N-glyoosylase that specifically 
excises the non-canonical base at all 
sites in which it is incorporated. 

After non-canonical base excision, treat 
the DNA so as to cleave it at abasic sites 

Example of a cleavage (e.g. using £ coff Endonuctease IV). 

product from Sample- 1 

Example of a cleavage 
product from Sample-2 

Example of a cleavage 
product from Sample-3 



Example of a cleavage 

-«r— • product from Sampie-4 



4. 



(D Separate the fragments 
(e.g. by denaturing PAGE). 



StM ♦-Uteied *.ue«led 
Standard* Semple-l Sample* 



♦- Ub ilid ••Label** e-LaM»d e-Ubated •■ 
Swnple-4 SempJe-l S«npta-2 Sentplt-J 




© 



Detect the labeled fragments 
(e.g.s in separate lanes using 
X-ray film for a radioactively- 
labeled primer or another 
appropriate method for a non- 
radioactive label; or in 1 lane 
using a method that can 
differentially detect 2 or more 
differently-labeled 
non-radioactive primers). 



While many bands would be 
generated for most samples, 
the example to the left shows 
only a few bands for illustrative 
purposes. 
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